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ABSTRACT 

We combine HST imaging from the GEMS 1 survey with photometric redshifts from COMBO-17 
to explore the evolution of disk-dominated galaxies since z < 1.1. The sample is comprised of all 
GEMS galaxies with Sersic indices n < 2.5, derived from fits to the galaxy images. We account fully 
for selection effects through careful analysis of image simulations; we are limited by the depth of the 
rcdshift and HST data to the study of galaxies with My < —20, or equivalently log (M. / M.®) > 10. We 
find strong evolution in the magnitude-size scaling relation for galaxies with My —20, corresponding 
to a brightening of ~1 magarcsec -2 in rest-frame V-band by z ~ 1. Yet, disks at a given absolute 
magnitude are bluer and have lower stellar mass-to-light ratios at z ~ 1 than at the present day. As 
a result, our findings indicate weak or no evolution in the relation between stellar mass and effective 
disk size for galaxies with log (M/Mq) > 10 over the same time interval. This is strongly inconsistent 
with the most naive theoretical expectation, in which disk size scales in proportion to the halo virial 
radius, which would predict that disks are a factor of two denser at fixed mass at z ~ 1. The lack 
of evolution in the stellar mass-size relation is consistent with an "inside-out" growth of galaxy disks 
on average (galaxies increasing in size as they grow more massive) , although wc cannot rule out more 
complex evolutionary scenarios. 

Subject headings: galaxies: spiral - galaxies: evolution - galaxies: high redshift - surveys - cosmology: 
observations 



1. INTRODUCTION 

The last eight billion years have witnessed strong evo- 
lution of the disk galaxy population. Both 'archaeolog- 
ical' studies of local disk-dominated galaxies and 'look- 
back' studies of the evolution of disk galaxies suggest 
a steady build-up in their stellar masses since z ~ 1 
(Rocha-Pinto et al. 2000; Flores et al. 1999; Bell et al. 
2005; Hammer et al. 2005). Insights into how this growth 
occurs are accessible through the study of disk galaxy 
scaling relations, such as the luminosity-rotation veloc- 
ity (Tully-Fisher) relation or the luminosity-size relation 
(e.g. Vogt et al. 1996; Lilly et al. 1998; Simard et al. 
1999; Bohm et al. 2004). Yet, owing to sample size 
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limitations, selection effects, and differences in analysis 
techniques, these studies have come to widely divergent 
conclusions. In this paper, we explore the evolution of 
the luminosity-size and stellar mass-size relations over 
the last 8 Gyr (since z ~ 1) using a sample of almost 
5700 disk-dominated galaxies from the HST GEMS sur- 
vey (Galaxy Evolution from Morphology and SEDs Rix 
et al. 2004). 

In the Cold Dark Matter (CDM) picture of structure 
formation, collapsing dark matter perturbations acquire 
angular momentum through tidal torques and mergers 
(Peebles 1969; Mailer et al. 2002; Vitvitska et al. 2002). 
Some fraction of this angular momentum is conserved, 
leading to the formation of cold, rotationally-supported 
disks. The typical magnitude of the specific angular mo- 
mentum predicted in this framework leads to the forma- 
tion of present day disks with approximately the correct 
distribution of radial sizes, if the specific angular momen- 
tum of the gas is similar to that of the dark matter and 
is mostly conserved during the formation process (Fall & 
Efstathiou 1980). 

A difficulty is that this idealized picture does not corre- 
spond to the outcome when the process of galaxy forma- 
tion is simulated in detail within the cosmological con- 
text of CDM. In hydrodynamical simulations, the gas 
tends to lose a large fraction of its initial angular mo- 
mentum, resulting in disks that are too small compared 
to observed nearby galaxies (Navarro & White 1994; 
Sommer-Larsen et al. 1999; Navarro & Steinmetz 2000; 
D'Onghia & Burkert 2004). Furthermore, very few 'late- 
type' disks are formed in such simulations: galaxies tend 
to suffer mergers that thicken and destroy their disks 
(Steinmetz & Navarro 2002). It is not yet established 
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whether this problem represents a fundamental difficulty 
with the 'standard' CDM paradigm (i.e., a result of ex- 
cess small scale power), a reflection of our incomplete 
ability to understand and simulate the complexities of 
star formation and supernova feedback, or inadequacies 
in numerical resolution. 

Many proposed solutions to this problem involve delay- 
ing gas collapse and disk formation to later times, cither 
by adopting an alternate power spectrum with reduced 
small scale power (such as Warm Dark Matter) , in which 
structure formation occurs later (e.g. Sommer-Larsen & 
Dolgov 2001), or by invoking some form of feedback that 
prevents the gas from cooling until relatively late times 
z ~ 1 (Weil et al. 1998; Thacker & Couchman 2001). 
While these solutions would be consistent with an impor- 
tant build-up in the disk galaxy population at late times, 
the late formation times implied by these models may be 
in conflict with the old ages of disk stars in the Milky Way 
and M31 (Rocha-Pinto et al. 2000; Ferguson & Johnson 
2001). Additional constraints can be gleaned from so- 
called 'backwards evolution' models, in which the ages 
and metallicities of the stars in present-day disk galax- 
ies are used to constrain the formation history of differ- 
ent components within our and other galaxies (Chiappini 
et al. 1997; Boissier & Prantzos 1999). Direct measure- 
ments of the mass-size scaling relations and radial size 
distributions of disk galaxies at earlier epochs will pro- 
vide an important counterpoint to these arguments by 
providing direct constraints on the angular momentum 
content of stars at these earlier times. 

A number of previous studies have used the Hubble 
Space Telescope (HST) to quantify the evolution of disk 
galaxies by measuring their absolute sizes and magni- 
tudes as a function of rcdshift. Magnitude and size are 
strongly correlated; a line of constant surface brightness 
falls almost parallel to the distribution of observed galax- 
ies, making the evolution of galaxy surface brightness a 
natural choice for parameterizing the evolution of galaxy 
sizes. However, the results of studies measuring average 
rest-frame surface brightnesses as a function of rcdshift 
have proven controversial, ranging from detecting no evo- 
lution to rather strong evolution in the range of 1-2 mag 
arcsec -2 brightening by redshift z ~ 1. For example, 
Lilly et al. (1998) found an average increase of the sur- 
face brightness of ~ 1 mag by redshift z <~ 1. This result 
is supported by observations of galaxies at high redshifts 
(z ~ 2 — 3), detected in very deep ground-based near- 
infrared images (Labbe et al. 2003). Trujillo et al. (2004) 
estimate that the average rest-frame surface brightness 
of these objects is more than 2-3 mag arcsec -2 brighter 
than in the local universe. 

Simard et al. (1999) pointed out that selection effects 
play a crucial role in such analyses. After accounting 
for the different sources of incompleteness, Simard et al. 
(1999) and Ravindranath et al. (2004) argue that the 
luminosity-size relation of disk galaxies evolves by less 
than 0.4 mag arcsec -2 over the interval 0.25 < z < 1.25. 
Yet, in order to reproduce the observations, both groups 
found it necessary to introduce a new population of high 
surface brightness galaxies in the highest redshift bin 
(z <~ 1). A different interpretation was suggested by Tru- 
jillo & Aguerri (2004), who find strong evolution of the 
average rest- frame F-band surface brightness of <~ 0.8 
mag arcsec _ 2 at a rcdshift z <~ 0.7, also including a full 



treatment of completeness. 

In this work, we present the results from a new sample 
of disk-dominated galaxies from the GEMS survey. Each 
of our galaxies has a spectrophotometrically-measured 
rcdshift, a spectral energy distribution (Wolf et al. 2004, 
SED), and a stellar mass estimate (Borch 2004) from 
COMBO-17 . We combine these SED constraints with 
light-profile shapes and sizes determined from deep high- 
resolution HST Advanced Camera for Surveys (ACS) im- 
ages. We reassess the evolution of the magnitude-size 
and stellar mass-size relation as a function of rcdshift 
over the range 0.1 < z < 1.1, taking particular care to 
model the impact of the selection function. We suggest 
a resolution to the conflicting previous results by pre- 
senting a coherent picture of strong surface brightness 
evolution with redshift without the need for a new pop- 
ulation of high surface brightness galaxies. 

The layout of this paper is as follows. In § 2 we present 
the GEMS data set and describe the sample selection, the 
galaxy fitting techniques and the corrections we applied 
to the data. We explain in some more detail our modeling 
of the sample completeness in § 3. In § 4, we explore the 
evolution of the magnitude-size and stellar mass-size re- 
lations for disk-dominated galaxies. We show that there 
is a trend of increasing average surface brightness with 
rcdshift and that there is little evolution of the surface 
mass density. In § 5 we discuss our results in comparison 
with previous studies in the literature, and compare them 
with theoretical expectations. We summarize our results 
in § 6. Throughout this paper we use the concordance 
cosmology with H = 70 km s _1 Mpc -1 , CIm = 0.3 and 
Oa = 0.7 (Spergel et al. 2003). Unless indicated other- 
wise we use Vega- normalized magnitudes. 

2. SAMPLE DEFINITION 

2.1. Imaging Data 

GEMS, Galaxy Evolution from Morphologies and 
SEDs (Rix et al. 2004), has imaged an area of — 800 
arcmin 2 centred on the Chandra Deep Field South 
(CDFS), using the ACS on-board HST. Of these 78 
ACS tiles the central 15 were incorporated from the 
GOODS project (Giavalisco et al. 2004). With integra- 
tion times of ~ 35 min in each of two filters (F606W 
and F850LP) the point source detection limits reached 
m AB (F606W) = 28.3 (5a) and m AB (F850LP) = 27.1 
(5a), respectively. Details about the image mosaic and 
data reduction will be explained in a subsequent paper 
(Caldwell et al. 2005, in prep.). 

2.2. COMBO-17 Data 

The HST imaging data is complemented by low reso- 
lution spectrophotometric data from COMBO-17 (Wolf 
et al. 2004). COMBO-17 has provided precise redshift 
estimates (cr z /(l + z) <~ 0.02) for approximately 9000 
galaxies down to < 24. Rest-frame absolute magni- 
tudes and colors, accurate to ^0.1 mag, are also available 
for these galaxies. Furthermore, using a simple param- 
eterized star formation history and the photometry in 
the 17 COMBO-17 bands, Borch (2004) computed stel- 
lar mass estimates for each galaxy in our sample, assum- 
ing a Kroupa et al. (1993) stellar initial mass function 
(IMF). These mass estimates are consistent with those 
derived using a one-color-based transformation from light 
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to mass as described in Bell & de Jong (2001) and Bell 
ct al. (2003). While such estimates suffer from uncertain- 
ties in the IMF, ages, dust, and metallicity, it is encour- 
aging to note that several studies (Bell et al. 2003; Drory 
et al. 2004) find good agreement between masses based 
on broad-band colors and those from spectroscopic (e.g. 
Kauffmann et al. 2003a, b) and dynamical (Drory et al. 
2004) techniques. 

2.3. Source Detection 

For source detection we use the S Extractor soft- 
ware (Bcrtin & Arnouts 1996) on the F850LP image. 
In contrast to the standard single-pass approach, we ap- 
ply a two-step process, running SExtractor twice on 
each image to find an acceptable compromise between 
deblcnding and detection threshold (see Rix et al. 2004) . 
Combining the source lists from each tile, taking care 
to remove duplicate objects that were detected in two 
neighbouring tiles, we end up with over 40,000 galaxies. 

2.4. Galaxy Fitting and Disk Selection 

For the purpose of this paper we wish to isolate the 
subset of galaxies whose light is dominated by a disk 
component. We start by identifying all galaxies that 
can be reasonably well-fit by any single Sersic profile 
(Sersic 1968) using the two-dimensional fitting code gal- 
fit (Peng et al. 2002). The Sersic profile is a general- 
isation of a de Vaucouleurs profile with variable Sersic 
index n: 



S(i?) 



x exp 



(R/Rc 



,1/rt 



1 



(1) 



where R e is the effective or half-light radius, S e is the 
effective surface density, £ (R) is the surface density as 
a function of radius and k = n (n) is a normalization 
constant. An exponential profile has n — 1 while a 
de Vaucouleurs profile has n = 4. GALFIT convolves 
Sersic profile galaxy models with the point spread func- 
tion of the ACS (Jahnke et al. 2004, Jahnke ct al., in 
preparation) and then determines the best fit by com- 
paring the convolved models with the science data using 
a Levenberg-Marquardt algorithm to minimize the x 2 
of the fit. The best-fit model is given by 7 parameter 
values and their associated uncertainties, including the 
half-light radius, the Sersic index and the total magni- 
tude. Initial GALFIT starting guesses for the model pa- 
rameters were obtained from the SExtractor source 
catalogues. Typically, neighbouring galaxies were ex- 
cluded from each model fit using a mask, but in the 
case of closely neighbouring galaxies with overlapping 
isophotes the galaxies were fitted simultaneously. The 
sky level for each galaxy was carefully measured using 
flux growth curves, masking out detected neighbouring 
sources. Lacking an estimate for the Sersic index from 
SExtractor, we started all fits with n = 1.5. In addi- 
tion, all galaxies with 0.65 < z < 0.75 were fitted with 
GIm2d (Simard et al. 2002). Estimates for magnitudes, 
sizes and Sersic indices from the two codes agree very well 
(see Bell et al. 2004; Haufiler et al. 2005, in prep.). Mor- 
phological quantities quoted in the present paper were 
derived using GALFIT. 

For this study, we estimate structural and morpholog- 
ical parameters from the z-band images (F850LP). In 
the optical (and the near- infrared) , young stars make a 



progressively smaller contribution with increasing wave- 
length. Therefore, galaxy morphologies in F850LP are 
smoother than those in F606W, leading to a more ro- 
bust detection and deblending of extended sources. The 
F850LP band corresponds to rest-frame R, V, and B- 
bands at z ~ 0.4, 0.7, and 1 respectively. 

Selection of a galaxy sample for this kind of study is a 
multi-step process. First we merge the GEMS catalogue 
with the COMBO-17 rcdshift catalogue, then we select 
disk-dominated objects, and finally we remove sources 
with poor fits (see Fig. 1). We start by matching the 
GEMS sources to the COMBO-17 catalogue. To account 
for the relatively high source density in the GEMS im- 
ages we pick the closest neighbour in the COMBO-17 
catalogue within 0.5 arcsec as the corresponding match 
for a GEMS galaxy. Only at matching distances exceed- 
ing 1 arcsec does one start to include uncorrelated pairs. 
We are left with about 8000 matched sources in our sam- 
ple. 

We isolate disk-dominated galaxies for further study by 
cutting the sample based on the Sersic profile fits. We 
adopt n = 2.5 as the dividing line between disk- (n < 2.5) 
and spheroid-dominated (n > 2.5) galaxies. This cut 
discriminated between visually-classified early- and late- 
type galaxies from GEMS with 0.65 < z < 0.75 with 
80% reliability and less than 25% contamination (Bell 
et al. 2004). This cut is also consistent with the anal- 
ysis conducted by the Sloan Digital Sky Survey (SDSS; 
see Shen et al. 2003). Furthermore, Ravindranath et al. 
(2004) have redshifted a sample of local galaxies to show 
that the Sersic index is still a useful indicator at redshifts 
z ~ 0.5 — 1. Selecting galaxies with n < 2.5 left us with 
<~ 6200 disk-dominated objects. 

To ensure that the extracted galaxy profile parame- 
ters are reliable we remove objects from our source list 
that have relative formal errors in Sersic index n and 
effective radius R e of more than 25% (5n/n > 0.25, 
6R e /R e > 0.25) 12 . We also exclude objects that reach 
the boundary conditions for n (0.2 < n < 8) or R e 
(0.3 < R e [pixel] < 500). Furthermore, we require that 
the GALFIT magnitudes coincide with the SEXTRACTOR 

magnitudes to within 0.6 ITlcig (|^GALFIT ^SEx 

+0.166| < 

0.6). Finally, we remove compact sources with log i?g PP < 
max [8 — 0.4 x m z , 0] (indicated by the solid line in the 
bottom left panel in Fig. 1). While slightly more galax- 
ies with low surface brightness were removed by these 
additional cuts than high surface brightness galaxies, no 
pronounced bias was introduced. It is important to note 
that the simulated galaxy samples were also subjected to 
these same cuts for the construction of the completeness 
maps; thus, the completeness maps account fully for any 
biases introduced by these (necessary) extra sample cuts. 

This sample selection should provide a fair represen- 
tation of the disk-dominated galaxy population at all 
redshifts. The final catalogue contains 5664 disk galax- 
ies with absolute rest- frame B- and F-band magnitudes, 
redshifts and stellar masses obtained from COMBO-17 
and apparent half-light radii and Sersic indices from gal- 
fit. 

12 GALFIT formal errors underestimate the true uncertainties, as 
assessed using simulated galaxy images. The true uncertainties for 
the bulk of the sample arc ~ 35% in R e and ~ 0.2 mag in m z . 
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2.5. The Local Comparison Sample 

In order to compare our measurements to a local refer- 
ence point we have opted to use the NYU Value-Added 
Galaxy Catalog (VAGC; Blanton ct al. 2004), which is 
based on the second data release (DR2) of the SDSS 
(Abazajian et al. 2004). It contains Sersic fits for 28089 
galaxies in the redshift range 0.0033 < z < 0.05. For this 
paper, we use the VAGC elliptical aperture Sersic fits 
for estimates of r^s-band half-light radius, Sersic index 
and -flux; coupled with extrapolated circular aperture 
u ab 7 9ab 7 r AB jiABi z ab fluxes. The magnitudes were 
converted to absolute galactic foreground extinction- 
corrected magnitudes using the latest K-Correct rou- 
tines, which were also used for the original data (Blan- 
ton et al. 2003). We apply the following correction to 
convert the SDSS elliptical half-light r^s-band sizes to 
rest-frame V-band (see § 2.6): R e (V) = R e (r) x 1.011. 
The redshift of the individual SDSS sources does not im- 
pact significantly on this correction factor. To obtain 
a rest-frame -B-band size for the SDSS galaxies we use: 
R e (B) = R e (V) x 1.017. 

We have chosen the VAGC rather than the fits to 
the magnitude-size and stellar mass-size planes by Shen 
et al. (2003) for various reasons. Using the VAGC we 
have full control over all estimated parameters includ- 
ing the photometric system, k-corrections, etc. Specifi- 
cially, the fits by Shen et al. (2003) where performed on 
circularized size estimates while we use elliptical Sersic 
measurements. The half-light sizes and absolute magni- 
tudes by Shen et al. (2003) were provided only in SDSS 
filters, necessitating the use of color transformations and 
of additional luminosity function convolutions in order to 
obtain mean values for the same selection and photomet- 
ric system as the GEMS data. Furthermore, the VAGC 
allows us to repeat the same analysis procedure that was 
also used for the GEMS data. Finally, the VAGC incor- 
porates the latest version of the SDSS pipeline, leading 
to more robust Sersic indices, fainter apparent limiting 
magnitudes and fewer problems with deblending large 
sources. Since the VAGC and the data used by Shen 
et al. (2003) have ~ 20, 000 sources in common we could 
verify that the measured parameters broadly agree with 
each other. 

The VAGC does not contain stellar masses. Therefore, 
we have used the prescription given in Bell et al. (2003) to 
convert a (g — r) AB color into a SDSS r^-band stellar 
mass-to-light ratio: 

log (M/L r ) = -0.306 + 1.097 x (g - r) AB - 0.15. (2) 

We have applied a correction of —0.15 to convert to a 
Kroupa IMF, in accord with our GEMS stellar masses. 
The stellar mass was then obtained from the following 
relation: 

log (M) = log {M/L r ) - 0.4 x (M r;S - r ) , (3) 

with the absolute rest-frame Sersic magnitude M r ^s = 
rs — 5 log (Dl) — 25, the apparent rest-frame Sersic mag- 
nitude rs, the luminosity distance Dl and the absolute 
magnitude of the sun r Q = 4.67 in SDSS tab- Calculat- 
ing a stellar mass in the same fashion for the lowest red- 
shift GEMS galaxies and comparing this estimate with 
the SED-based masses (Borch 2004) reveals no apparent 
systematic offsets. 



2.6. Rest-Frame V-band Sizes 

Galaxies are known to exhibit radial color gradients. 
As a result of this, galaxy sizes vary as a function of 
wavelength and the measured physical size evolution of 
the galaxy population could be skewed by the effects 
of band shifting with redshift. Therefore, we have not 
simply converted our apparent half-light sizes i?g PP mea- 
sured in the F850LP filter to a physical value, but instead 
have applied a color gradient correction to each individ- 
ual galaxy according to its redshift to correct the size to 
the rest- frame V-band. For a sample of local galaxies, 
de Jong (1996) presents the relative disk scale lengths, 
which for a pure disk corresponds to R e = 1.678 x R^, in 
the B-, V-, i?-, I-, H- and iVbands. Figure 2 illustrates 
this ratio of the disk scale lengths in one band to the size 
measured in the V-band, as a function of the correspond- 
ing wavelength. A linear fit with the intercept fixed to 
1 at the V-band results in a slope of or = —0.184, cor- 
responding to correction factors varying by only ±3% 
over the whole redshift range. All future references to 
effective radii R e are to sizes corrected to the rest-frame 
V-band. 

In order to obtain rest-frame sizes for the SDSS data 
we have calculated the ratio of the circularized half-light 
sizes in the five SDSS bands, divided by the size in the 
SDSS ^s-band. We overplot the resulting values in 
Fig. 2, minimizing in a simultaneous fit the offset be- 
tween the SDSS points and the other V-band normal- 
ized measurements. The agreement between the various 
measurements is striking. This supports the validity of 
the average correction to obtain rest-frame sizes, bearing 
in mind the 20% galaxy-by-galaxy scatter, and that this 
method, strictly speaking, applies only to nearby galax- 
ies. 

Given the possible rapid evolution of galaxy disks in 
the last 8 billion years, it is not inconcievable that the 'av- 
erage' disk color gradient has evolved considerably since 
z ~ 1. In a subsequent paper we will reconstruct the 
rest-frame -B-band for individual galaxies and estimate 
sizes directly from this image to account for this effect. 
As an interim solution, we have tested the applicability 
of the local average relation on distant galaxies in GEMS. 
We have fit all GEMS galaxies in the F606W band using 
exactly the same approach used to fit in F850LP. Ow- 
ing to significant differences in the depth of the F606W 
and F850LP data, and F606W's extra sensitivity to on- 
going star formation, we consider the F606W fits at this 
stage to be preliminary 13 . From these fits we selected 
those sources for which one of the bands corresponds 
to the rest-frame V-band and measure the size ratio at 
z ~ 0.08 (F606W - V rcBt ) and at z ~ 0.64 (F850LP ~ 
V rost ). The average values from these measurements are 
overplotted in Fig. 2. They confirm the trend seen in the 
de Jong (1996) and SDSS data, supporting the validity 
of the correction we have applied to the data 14 . 

13 While many galaxy fits were reasonably successful, a non- 
ncgligiblc fraction of the fits are substantially in error. Thus, while 
on average, the F606W fits are reliable, it is impossible at this stage 
to use a weighted sum of the F606W and F850LP fits to directly 
estimate the rest-frame B- or V-band sizes on a galaxy-by-galaxy 
basis. 

14 It is worth recalling that the corrections implied by this rela- 
tion are rather small, ~ 3% for the average GEMS galaxy. Further- 
more, the evolution of average rest-frame V-band surface bright- 
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2.7. Completeness 

In order to estimate the limitations of the GEMS sur- 
vey we have performed extensive simulations of artifi- 
cial disk galaxy light profiles (sec Haufiler ct al. 2005, 
in prep.). By inserting a number of such artificial disk 
images with purely exponential profiles (Sersic index 
n=l), and subsequently re-running the source detec- 
tion and fitting process (including removal of bad fits 
according to § 2.4), we calculate our success rate: the 
completeness as a function of apparent effective radius 
i?* pp and apparent magnitude m z . It turns out that the 
contours of constant detection probability in the i?J pp - 
m z -plane (see Fig. 3) lie along lines of constant apparent 
surface brightness: 

liT = m z + 2.5 log(27rg) + 5 log(i^ pp / [arcsec]), (4) 

in the limit of bright magnitudes (q is the axis ratio) . At 
the faint magnitude limit, however, the lines of constant 
detection probability are at constant magnitude. The 
precise location of such a line depends also on the axis 
ratio of the objects. In the absence of dust, an object 
with high inclination has a higher detection probability 
than a source of the same apparent magnitude but viewed 
face-on. 

We model the detection probability as a function of 
the apparent magnitude. A double exponential model 
provides a good fit to the data (for a detailed description 
see appendix A). Both the shape and the characteristic 
magnitude limit at which a specific detection probability 
is reached depend on the apparent size and the axis ratio. 

Our final sample contains only the objects with red- 
shift estimates from COMBO-17 and therefore we must 
also account for the COMBO-17 completeness limit. 
Wolf et al. (2003) have calculated the completeness of 
COMBO-17 as a function of apparent i?-band aperture 
magnitude m Riaper , redshift and U — V color. In order to 
show the COMBO-17 completeness contours on Figs. 3, 
4, 6, 7 and 10, we statistically transform the COMBO-17 
completeness map into the m z — i? e plane (Appendix B). 
We adopt this analytic approximation to the COMBO- 
17 completeness in the rest of this paper, but note that 
the use of either the true COMBO-17 completeness map 
or the analytical mapping of the completeness maps onto 
the m z — R e plane in the analysis that follows does not 
affect our conclusions. 

We combine the GEMS detection probability and the 
COMBO-17 completeness by multiplying the two values 
for each individual object: 

P — PGEMS X PcOMBO-17- (5) 

We can now estimate the combined detection probability 
p of individual galaxies. Since later on we weight galaxies 
by the inverse of the detection probability we have taken 
special care when using very low detection probability 
values. In order to avoid attributing large weights to 
any given galaxy (which would then dominate the whole 
sample), we remove any object with p < 5% from the 
sample (a total of 14 sources). For the main analysis 
presented here, we only include objects with a detection 

ness is dominated by galaxies with z > 0.6, where the F850LP 
samples rest-frame V-band almost directly, and by the SDSS data 
at 2 < 0.05; thus, further reducing our sensitivity to any errors in 
the size correction. 



probability p > 50%. In appendix C we discuss in more 
detail how the detection probability will impact on the 
evaluation of the data especially in the magnitude-size 
plane, which is also the reason for not removing galaxies 
with 0.05 < p < 0.5 from the sample altogether. In Fig. 4 
we illustrate the resulting detection probability function 
in the i?g pp -m z -plane. 

The completeness p suss of the SDSS data is parame- 
terized as a function of surface brightness /Z5o, r and posi- 
tion on the sky RA (a) and dec (6) (Blanton et al. 2004): 

Psdss (M50,r, OL, 5) = fti (/i50,r) x fsp (V>50,r) x fph (fJ>50,r) x 

j got (oc,5), where fu is the "tiling" fraction, f sp is the 
spectroscopic completeness, f p h is the photometric com- 
pleteness and fg t is the fraction of main targets for which 
a classification was obtained in this object's sector, as de- 
scribed in more detail in Blanton et al. (2004). The re- 
sulting completeness as a function of surface brightness 
we present in Fig. 5 for the case f got {a, 5) = 1. Note 
that the rapid drop of the completeness at high surface 
brightnesses directly results from the improper deblcnd- 
ing of the largest nearby galaxies. We have approximated 
the data points given in Blanton et al. (2004) with the 
following analytical formula: 

„ / /M50 r — 23.6 \ \ 

Psdss = 0.99 xexpl - exp I '-^ I I 

x(l-exp(-ex P (^^l))) (6) 

In the subsequent analysis we only consider objects with 
a completeness Psdss > 0.5, in order to match the selec- 
tion of the GEMS galaxies. 

3. ANALYSIS OF COMPLETENESS AND SELECTION 
EFFECTS 

In the following sections we evaluate the magnitude- 
size and stellar mass-size relations as a function of red- 
shift. We have subdivided our sample of disk galaxies 
into five redshift bins, each of which spans a range of 0.2 
in redshift, centred on z = 0.2,0.4,0.6,0.8,1.0, plus an 
additional redshift bin at z <~ 0.0 for the SDSS data. 

In Fig. 4 we show the combined completeness map with 
observed disks with 0.35 < q < 0.65 overplotted. The 
galaxies in the sample form a relatively tight relation in 
the apparent magnitude-size plane. Inspecting the slope 
a of this relation one realizes that it is close to, but not 
exactly equal to a line of constant surface brightness. A 
linear fit provides a slope a ~ —0.15. In physical quan- 
tities this slope closely matches that of a line of constant 
volume density, i.e. a law such that the ratio of flux and 
the cube of the radius is constant (a = —0.13), rather 
than a line of constant surface density (a — —0.2). The 
fact that the slope does not match that of a constant 
surface brightness implies that measuring average sur- 
face brightnesses depends to some extent on the range in 
magnitudes over which the average is calculated. Thus, 
in order to quantify the evolution of the surface bright- 
ness one has to make sure that the same range of absolute 
magnitudes is observed at all rcdshifts. 

The sample becomes approximately magnitude-limited 
at m z ~ 23.5. This limit is imposed by the COMBO- 
17 redshifts; fainter galaxies cannot be assigned reliable 
rcdshifts. Furthermore, we find no galaxies at brighter 
magnitudes m z < 23 with detection probabilities less 
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than 50%. Since we show in Fig. 4 galaxies of all red- 
shifts, this implies that the GEMS data are not limited 
in surface brightness at any redshift, even at the highest 
bin. Therefore, our subsequent analysis is not affected 
by a possible completeness- induced truncation of the sur- 
face brightness distribution of the galaxy population at 
any redshift. We conclude that the combined GEMS 
+ COMBO-17 sample is essentially magnitude-limited 
only, with surface brightness playing a minor role. This 
conclusion is robust to the detailed choice of axis ratios. 

We have translated these completeness contours to the 
absolute magnitude-size plane in Fig. 6. To estimate 
the absolute magnitude, we fit a third-order polynomial 
to the "average apparent z minus rest-frame apparent 
V color" (m z — 77j y at ) of our sample as a function of 
redshift. Obviously a redshift dependence cannot fully 
model this color, leading to a small additional scatter of 
the data relative to the transformed completeness map 15 . 

In Fig. 6 we also overplot the SDSS completeness. At 
the low surface brightness edge a fairly large number 
of SDSS objects are found with very low completeness 
values; the VAGC does not sample the full distribution 
of surface brightnesses. We adopt an absolute magni- 
tude cut of My < —20 in this paper: brighter than 
this limit the size distribution is sufficiently narrow that 
the full range of surface brightnesses is well-sampled. In 
order to estimate where the apparent magnitude limit 
mj! m = 17.77 starts to affect the galaxy distribution we 
convert mj, lm into an absolute magnitude M v m = —18.8 
for the highest redshift in the VAGC using a color trans- 
formation for a typical Sbc (Fukugita et al. 1995). Again, 
this limit is fainter than our adopted absolute magnitude 
cut. 

Inspection of Fig. 6 shows that the sample reaches 
My < —20 in the highest redshift bin; therefore in what 
follows we restrict our analysis to this absolute magni- 
tude range at all redshifts. This selection leaves 3584, 
76, 176, 704, 671 and 559 disk galaxies in the respec- 
tive redshift bins z = 0.0,0.2,0.4,0.6,0.8,1.0; a total of 
3584 + 2186 galaxies. This magnitude cut implies that 
our results are applicable only over this brightness range. 

We have explored in detail the influence on the aver- 
age surface brightnesses and surface densities of varying 
the p > 50% criterion, the surface brightness range over 
which one averages, and the absolute magnitude range 
considered. The influence of the p cut is negligible; the 
surface brightness and magnitude ranges do affect the av- 
erage surface brightnesses/densities, and great care must 
be taken to choose appropriate integration ranges. These 
issues are discussed where relevant in §5, and in great de- 
tail in Appendix C. 

4. ANALYSIS OF THE MAGNITUDE-SIZE AND STELLAR 
MASS-SIZE RELATION 

For our subsequent analysis we define the absolute rest- 
frame effective surface brightness in the V-band as: 

liy = My + 5 log R e + 2.5 log q + 38.568, (7) 

with the absolute rest-frame magnitude in the V-band 
from COMBO-17 and the half-light radius R e in kpc. 

15 These transformed completeness maps arc not used in the 
science analysis; rather, they are included in the figures for presen- 
tational purposes alone. 



The constant 38.568 results from using sizes in kpc and 
luminosity distances in Mpc. Note that this formula is 
correct even for a general Sersic profile. In the analysis 
of the evolution of fiy we will only address the bright 
galaxy population with My < M v m = —20. Moreover, 
we define the "equivalent" absolute rest-frame surface 
mass density 

log = log M - 2 log R e - log {2nq) , (8) 

where the SED-estimated stellar galaxy mass M. is given 
in M.q. In the case of log E» we restrict the sample to 
galaxies with log {M/M Q ) > log7W lim = 10. We cal- 
culate average values of the surface brightness (/iy (z)) 
and the surface mass density (log Ex (z)), correcting for 
incompleteness by weighting indiviual galaxies by the in- 
verse of their detection probability as a function of red- 
shift. We obtain errors on the estimated mean values by 
performing an extensive Monte-Carlo analysis (see ap- 
pendix C). 

4.1. The Magnitude-Size Relation 

In Fig. 7 we present the magnitude-size relation for 
disk galaxies in six redshift bins extending to z ~ 1.1. 
We stress that the completeness contours shown in the 
figure are only indicative as they were calculated for a 
fixed axis ratio q = 0.5 and the central redshift of the 
corresponding bin (see also Fig. 3). Therefore, especially 
in the z — 0.2 redshift bin, we sec many galaxies "spilling 
over" into the incompleteness regions, which is a result 
of the non-negligible range of My cutoffs over redshifts 
0.1 < z < 0.3. To illustrate this effect we overplot 
vertical lines corresponding to an apparent magnitude 
m z = 24 at the centre, low and high end of each redshift 
bin (for z > 0.2). With increasing redshift (co- moving 
volume) the spread of the completeness becomes smaller. 
The detection probabilities for individual galaxies, how- 
ever, were calculated according to their exact magnitude, 
size and axis ratio and not relative to the plotted com- 
pleteness contours. In the case of the z ~ 0.0 redshift 
bin we only indicate the brightness level, below which 
the highest redshift galaxies are not fully sampled. 

As the completeness function limits us to detecting 
only the bright galaxies at high redshift, we limit our 
analysis to galaxies with My < —20. Recall also that 
we have demonstrated in §3 and Appendix C that we 
are not limited in absolute surface brightness even at the 
highest redshifts. Therefore, to evaluate the evolution of 
disk galaxies in the magnitude-size plane we have calcu- 
lated the average rest-frame absolute surface brightness 
(fly (z)) as a function of redshift including weighting of 
individual galaxies according to their detection probabil- 
ities. In Fig. 8 we show the weighted histograms of /iy (z) 
for each redshift bin. Indicated in each panel (at each 
redshift bin z ) are the estimated mean surface bright- 
nesses (/iy (zo)) together with the mean values of the pro- 
ceeding redshift bins (/iy (z < zq)) for comparison. This 
plot demonstrates clearly that there is a significant trend 
of increasing surface brightness with increasing redshift. 

We demonstrate how the mean surface brightness of 
the disk galaxy population evolves by plotting (/iy (z)) 
as a function of redshift in Fig. 9. Fitting a lin- 
ear function to the data we find an intercept and 
slope of (/iy (z = 0)) = 20.84 ± 0.03 mag arcsec~ 2 and 
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d {jiv (z)) /dz = —0.99 ± 0.06, respectively, thus an evo- 
lution of <~1 mag to z = 1. 

4.2. The Stellar Mass- Size Relation 

These stellar mass estimates allow us to investigate the 
evolution of the analogous quantity to the magnitude 
size relation: the stellar mass-size relation. Working in 
terms of stellar mass is useful not only because it is one 
step closer to the quantities actually predicted by the- 
ory, but also because it removes the evolution that is 
simply due to the aging of the stellar populations. We 
present the stellar mass-size relation in Fig. 10. Again, 
iso-density contours show the distribution of galaxies in 
the R e -M rest-frame plane. We use the same method 
as before to correct the size estimates to the rest frame 
y-band (it is important to note that ideally we would 
prefer to study stellar mass vs stellar mass weighted size, 
but we do not attempt this further correction here). 
As in the case of the average surface brightness (z)) 
we estimated the average stellar surface mass density 
(logS^vi {z)), as defined in cq. 8, for each redshift bin. 

We found that, as in the case with the surface bright- 
ness, the distribution of galaxies in the stellar mass-size 
plane does not fall exactly along a line of constant stellar 
surface mass density, but is of somewhat shallower slope. 
However, here the effect is much less pronounced (also 
due to the width of the distribution) and therefore, the 
precise cut-off in stellar mass, which is the equivalent of 
absolute magnitude, is not as important. 

Plotting mass as a function of magnitude for different 
redshift bins we find that log (M/Mq) — 10 is a good 
approximation of the limiting mass in the highest red- 
shift bin. In the calculation of (log Yjm (z)) we include 
the effects of completeness in exactly the same way as 
before, i.e. we compute (log Y>m (z)) using a cut in stel- 
lar mass log (M/Mq) > 10 and we weight galaxies with 
the detection probabilities derived from Fig. 3. 

In Fig. 11 we plot (log Y>m (z)) as a function of redshift 
and find that the average surface mass density, to first or- 
der, does not evolve significantly with redshift. The over- 
all data values are found within 8.44 < (logS^ (z)) < 
8.57. This is also illustrated in Fig. 12 where we plot 
the histograms of the stellar surface mass density for the 
individual redshift bins. The deviation of the lowest and 
the highest data point corresponds to only 34% in surface 
mass density. Fitting a line with constant slope zero to 
the data yields (log S^vi (z)) = 8.50±0.03. We stress that 
the validity of this estimate does depend strongly on sys- 
tematic errors in the measurement of the stellar masses. 
The error bars do not account for such effects and there- 
fore might present a somewhat oversimplied view. 

The constancy of the stellar mass-size relation above 
log (M/ Mo) ~ 10 since z ~ 1 comprises a strong con- 
straint on models of disk galaxy evolution. The simplest 
possible interpretation of the data is that galaxies grow 
inside-out: assuming that galaxies can only increase their 
stellar mass with time, in order to stay on the stellar- 
mass size relation as they grow in mass, galaxies must in- 
crease their scale- lengths accordingly. Yet, clearly, more 
complex and physically-motivated models will also be ca- 
pable of fitting the data. 

5. DISCUSSION 

5.1. Surface Brightness Evolution 



In order to facilitate comparison with previous stud- 
ies, we repeat the analysis in the rest-frame B-band (us- 
ing absolute -B-band magnitudes from COMBO-17 and 
correcting the sizes to £>-band). We convert the effec- 
tive surface brightnesses to central surface brightnesses 
assuming effective size and disk scale length scale as 
R e = 1.678 x R d : 

Mo,s = M B +5 log .Re - 5 log(1.678) + 38.568, (9) 

This is strictly true only for pure disk galaxies, but 
should be a reasonable approximation since the peak of 
our Sersic index distribution roughly coincides with the 
exponential case n <~ 1. As before, we find strong evo- 
lution in the rest £?-band surface brightness with red- 
shift. For the intercept and slope in the rest-frame B- 
band we find (fio,B ( z = 0)) = 21.11 ± 0.03 mag arcsec -2 
and -1.43 ±0.07, respectively (see Fig. 13). 

In contrast to this picture of strong evolution, several 
previous authors have found results consistent with weak 
or no evolution in the average surface brightness out to 
z <~ 1 (e.g., Simard et al. 1999; Ravindranath et al. 2004). 
In this section, we discuss how these apparently contra- 
dictory findings, based on similar data, can be reconciled. 

5.1.1. Are the Datasets Significantly Different? 

We can rule out differences in the datasets as the 
source of our divergent conclusions. Owing to the simi- 
larity of the datasets, we can reproduce the analysis of 
Ravindranath et al. (2004) in some detail. Ravindranath 
et al. (2004) assessed the average B-band central surface 
brightness of their sample as a function of redshift, lim- 
ited in surface brightness to /if,™ < 20.6. For both the 
GOODS and the GEMS data sets the Ravindranath et al. 
(2004) surface brightness limit implies removing half to 
two thirds of all galaxies at 0.25 < z < 0.50 that are 
detected above the absolute magnitude limit and have a 
measured redshift. Note that only <~ 5% of all galaxies 
were excluded at the highest redshift (1.00 < z < 1.25). 
Obviously, by using only one third of galaxies with the 
highest surface brightness, one introduces a strong bias 
in the measurement of (fto,B (z)) and the derived value 
will therefore not represent the average properties of 
disk galaxies at that redshift. Adoption of the surface 
brightness limit used by Ravindranath et al. (2004) yields 
very consistent results to theirs for (/j,o,b {z)) (right-hand 
panel of Fig. 13). For the GEMS data we find evolu- 
tion at less than the 0.4 mag arcsec -2 level using their 
selection criteria. As expected the high redshift data 
points are the least affected by their surface brightness 
limit. However, at lower redshift the results achieved 
using their selection criteria start to deviate systemati- 
cally from the analysis we presented earlier. Specifically, 
the lowest redshift point with the surface brightness cut 
is more than 10ct off the expected value (as estimated 
from our linear relation) without such a cut. Simard 
et al. (1999) adopted a very similar strategy, and also 
found very weak evolution, although in their case low 
number statistics are also an important source of uncer- 
tainty (there are only 5 and 6 galaxies in their lowest two 
redshift bins, respectively). 

5.1.2. Are the Analysis Techniques Different? 

We argue that the divergence between our conclusions 
and those of Simard et al. (1999) and Ravindranath et al. 
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(2004) is driven primarily by important differences in the 
analysis techniques. 

The analysis of Simard et al. (1999) and Ravindranath 
et al. (2004), justifiably imposed the selection function 
of high-redshift galaxies on the low-redshift galaxy pop- 
ulation, and asked whether the average surface bright- 
ness of galaxies which one could have in principle seen 
at z <~ 1 has evolved. Clearly, because of cosmological 
surface brightness dimming the bulk of nearby galaxies 
would be invisible if placed at z ~ 1, and arc omitted 
from consideration. One then finds little difference in the 
population of local galaxies that would be observable at 
z <~ 1. 

In this paper, we adopt a different approach. In 
essence, we step gradually outwards from low redshift 
to higher redshift, asking at each stage if there is any 
evidence that the results are significantly biased due to 
cosmological surface brightness dimming. The z ~ 
SDSS data are clearly not surface brightness limited for 
galaxies with My < —20. Stepping outwards to z <~ 0.2 
in the GEMS data, the surface brightness limits are well 
clear of the observed drop-off in galaxy number density 
for galaxies with My < —20. Similarly for z ~ 0.4, 
z ~ 0.6, and z <~ 0.8; at each redshift we have clearly de- 
tected both sides of the size distribution in a region where 
completeness is > 90%, and the observed drop-off is real. 
Thus, the observed evolution, at least out to z ~ 0.8, is 
a genuine property of the entire disk galaxy population 
and is unaffected by surface brightness dimming. 

At 2 ~ 1, it is less obvious that the data are well 
clear of the selection boundaries — we correct for incom- 
pleteness using the estimates obtained by applying our 
pipelines to artificial galaxies. Yet, even at the z ~ 1 bin 
we reach well beyond the peak of the surface brightness 
distribution (see Fig. 8), within the limits that we can 
confidently correct for incompleteness. Therefore, cither 
the evolution we measure in that bin is roughly correct, 
or the galaxy surface brightness distribution would have 
to be bimodal. In that case we could not observe a hypo- 
thetical second peak of low surface brightness galaxies. 
Furthermore, these galaxies would have to fade signifi- 
cantly (and faster than the "normal" galaxy population) 
with time, because otherwise we would detect these ob- 
jects at lower redshifts. So far there are neither observa- 
tional nor theoretical grounds on which to expect such a 
population of low surface brightness galaxies. 

It is worth noting that if the galaxy population did 
not evolve towards higher surface brightness at higher 
redshift, we would have seen that in the data, as the 
sample out to z ~ 0.8 is clearly deep enough to probe 
the entire My < —20 galaxy distribution in the high 
completeness region. 

5.2. A New Population of High Surface Brightness 
Galaxies at High Redshift? 

Simard et al. (1999) and Ravindranath et al. (2004) 
suggest that at z ~ 1 a distinct population of very 
high surface brightness galaxies emerges that is not de- 
tected at lower redshifts. Simard et al. (1999) describe 
these objects as sources with very high surface brightness 
Mo.s ^5 18 (they found 9 candidates; 18% of all galaxies 
detected at that redshift). Ravindranath et al. (2004) de- 
lineate this group of objects as compact (R e < 0.8 kpc) 
and bright (My < —21.5). Using their classification, 



< 5% of the galaxies at z ~ 1 fall into this category. 

One might conjecture that the introduction of a new 
population of high surface brightness galaxies at z ~ 1 
simply arises in order to interpret the increasing aver- 
age surface brightness within a global picture of a non- 
evolving (ho.b (z))- Our results suggest that the whole 
distribution of surface brightnesses shifts with redshift, 
naturally leading to a larger number of high surface 
brightness galaxies at higher redshift. Furthermore, we 
find no evidence that the surface brightness distribution 
changes its shape (at the 10% level, from inspection of 
Fig. 8). 

Interestingly, both Simard et al. (1999) and Ravin- 
dranath et al. (2004) introduce the appearance of this 
new group of objects just at the high redshift limits of 
their surveys. At those redshifts z ~ 1 — 1.2 their val- 
ues for the average surface brightness are generally in 
agreement with the GEMS data points. We have shown 
in Fig. 13b that we can reproduce the effect of a flat- 
tening in the evolution of (ho,b (z)) by introducing a 
hard upper surface brightness cut. However, the bulk of 
the remaining evolution appeared in the highest redshift 
bin (and hence one could propose the introduction of a 
new class of high surface brightness galaxies to account 
for this). After removing the highest surface brightness 
galaxies as classified by Ravindranath et al. (2004) or 
Simard et al. (1999), even the GEMS data do not show 
a significant redshift-dependent trend in ((j>o,b {z)) (see 
Fig. 13b). Although this line of reasoning appears to 
be consistent it nevertheless has a major drawback. At 
the lowest redshift the results should agree with the aver- 
age surface brightness obtained from the SDSS. This fact 
alone should raise strong concerns regarding the global 
sampling of the local galaxy population. Only strong 
evolution of (fio,B (z)) ~ 21.1 — 1.432 can account for 
both convergence with the local data point and the high 
redshift results from Simard et al. (1999), Ravindranath 
et al. (2004) and the results presented in this paper. 

To summarize, we believe that the weak surface bright- 
ness evolution found by Ravindranath et al. (2004) and 
Simard et al. (1999), and the emergence of a 'new pop- 
ulation' of high surface brightness galaxies at z <~ 1 re- 
sults from differences between their analysis technique 
— which imposes the high-redshift selection function on 
galaxies at all redshifts — and our analysis technique, 
which implicitly steps out gradually from the local to- 
wards the high redshift universe, asking if there is any 
evidence for the galaxy distribution running into the sur- 
face brightness detection limits. Applying the same se- 
lection criteria as Ravindranath et al. (2004), we also 
found weak surface brightness evolution. The disadvan- 
tage of this approach is that it cannot yield quantitative 
statements about the evolution of the global ensemble of 
disk galaxies, especially at low redshift. 

5.3. Comparison with Theoretical Expectations 

The basic picture of disk formation within a hierar- 
chical universe posits that the dark matter and gas are 
'spun up' by tidal torques in the early universe. The in- 
ternal angular momentum is generally characterized by 
the dimensionless spin parameter, A. Assuming that the 
gas does not suffer significant loss of specific angular mo- 
mentum during collapse, the size of the resulting disk Rd 
is expected to scale as Rd oc \r i} where rj is the radius 
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enclosing the gas before collapse (see e.g. Mo et al. 1998). 
In cosmological N-body simulations, it is found that the 
distribution of values of A for dark matter halos follows 
a characteristic log-normal form, and that the value of A 
does not correlate with halo mass, nor does the distribu- 
tion of A evolve with time (Bullock et al. 2001). Thus, to 
first order, we expect the size of a disk of fixed mass to 
scale with time in proportion to the virial radius of the 
dark matter halo: 



R d (z) = R d (0) x 



H(z) 



H(0) 



-2/3 



(10) 



where Rd (0) is the scale length at z — and H(z) is the 
Hubble parameter as a function of rcdshift (Mo et al. 
1998). Using the definition of the surface mass density 
cq. 8 we find: 



log H M (z) = log H M (0) + I log |^| , 



(11) 



with the surface mass density at redshift zero log (0). 
Since we are interested in the relative evolution only, we 
normalize the curve to the observed value log Ex (0) = 
8.5 and show the redshift dependence in Fig. 11. The 
expectation of this very naive model is that disks at z <~ 1 
should be a factor of two denser at fixed mass than they 
are at the present day, in clear contradiction with the 
observational results. 

In reality, however, we expect there to be several other 
competing factors. For example, the internal density pro- 
file of the dark matter halo, as commonly characterized 
by the concentration c, will also impact the final size 
of the disk, in the sense that halos with higher concen- 
tration will produce smaller, denser disks. The average 
concentration at fixed halo mass is a function of epoch, 
scaling as c oc (1 + z)" 1 (Bullock et al. 2001). Thus, the 
fact that halos were less concentrated at z <~ 1 by about 
a factor of two will tend to counteract the strong evolu- 
tion in surface density indicated above. As well, there 
are numerous other complications: there is certainly not 
a straightforward relationship between halo mass and the 
mass of baryons that collapse to form a disk; the specific 
angular momentum of the baryons that comprise the disk 
may not be equal to that of the dark matter halo; the 
disk size can be affected by the presence of a pre-existing 
bulge; and halos with low spin parameters and/or large 
disk masses may not be able to support a stable disk (Mo 
et al. 1998). In addition, a proper comparison of the pre- 
dicted evolution of the disk mass-size relation with the 
data requires a careful treatment of the observational se- 
lection effects. We defer this analysis to a future work 
(Somerville et al. in prep). 

6. SUMMARY 

Based on two-dimensional fits to the light profiles of 
all GEMS sources we have compiled a complete and un- 
biased sample of disk galaxies. Our disk sample was de- 
fined by its radial profile, specificially by Sersic profiles 
with concentrations lower than n = 2.5. COMBO- 17 
provided us with redshifts, rest-frame absolute magni- 
tudes and stellar masses for ~ 5700 sources. In order 
to compare the GEMS data to a local reference we have 
obtained the VAGC, containing the same information as 
provided by GEMS for - 28, 000 nearby (z < 0.05) SDSS 



galaxies. Inspecting the magnitude-size and the stellar 
mass-size relation for disk galaxies as a function of red- 
shift we have come to the following conclusions: 

• At high redshifts z ~ 1 the GEMS survey is complete 
only for galaxies with absolute magnitudes My < —20 or 
stellar masses log (A4/A4q) > 10. In order to properly 
address the potentially severe biases that arise when one 
attempts to explore the evolution of the galaxy popula- 
tion over this redshift range, we have computed a detailed 
2-dimensional selection function and introduced a lower 
limiting absolute magnitude cut. 

• Treating completeness and selection effects carefully, 
we find that the average surface brightness of disk galax- 
ies increases with redshift, by about 1 magnitude from 
z <~ 1 to the present in the rest-frame V-band. 

• The values calculated in our study are consistent at 
the high redshift end with the results of Ravindranath 
et al. (2004) and Simard et al. (1999) and at the low red- 
shift end with the value estimated from the SDSS VAGC. 
We have shown that the reasons the studies of Simard 
et al. (1999) and Ravindranath et al. (2004) reached 
rather different conclusions from our own (weak or no 
surface brightness evolution over the same redshift range) 
are primarily related to the way the data were analyzed, 
as well as to problems with small number statistics in 
the lower redshift bins. In particular, applying a hard 
lower surface brightness cut leads to removing substan- 
tial numbers of galaxies in the low redshift bins, and to a 
strong bias in the estimated value of the average surface 
brightness. This approach yields average surface bright- 
ness estimates at low redshift z <~ 0.2-0.4 that do not 
converge with the "zero redshift" results from SDSS. We 
confirmed that when we apply the same selection criteria 
to the GEMS data, we obtain results that are consistent 
with those of Ravindranath et al. (2004) . 

• In contrast to the conclusions of Simard et al. (1999) 
and Ravindranath et al. (2004), we find that there is 
no need to appeal to a new population of high surface 
brightness galaxies, which makes its appearance at high 
redshift. The increased number of high surface bright- 
ness galaxies at high redshift is a natural result of the 
surface brightness evolution that we have detected. 

• While the magnitude-size relation shows strong evo- 
lution with redshift, we show that the stellar mass-size 
relation stays constant with time. 

• The most naive theoretical expectation is that disks 
of fixed mass should be about a factor of two denser 
at z ~ 1, in clear contradiction with our results. Sev- 
eral competing factors probably conspire to produce the 
weaker evolution that we observe. 

• As the stellar mass of galaxies increases with time, 
the fact that the surface mass density does not evolve as a 
function of rcdshift implies that on average disk galaxies 
form inside-out, i.e. through increasing their disk scale 
lengths with time as they grow in mass. 
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APPENDIX 

PARAMETERIZATION OF THE DETECTION PROBABILITY 

The detection probability p for the GEMS data as a function of apparent magnitude to is well fitted by a double 
exponential function: 

/ I ' m — too 
Pgems = exp I - exp I — 



(Al) 



with the slope a and the characteristic magnitude toq. Both the slope a — a (i?g pp , q) and the characteristic magnitude 
to = to (Rl PP ,q) are a function of the apperant half-light radius R app [pix] and the axis ratio q. Since the smallest 
objects included in the simulations have half-light radii i?g pp > 0.3 pix we hold the GEMS completeness fixed at sizes 
R* pp < 0.5 pix: log R = max [log (R^ pp ) , 0.5]. The slope a is defined as: 

a = a (q) + <J X (q) X R (A2) 

with 



0.0860 + 0.118 x q 
0.308 - 0.0634 x q 



(q) 
<*\ (<l) 

The characteristic magnitude Too is defined as: 

TO =min[22.5 + TO((7) x log R + (7.37 - 1.83 x m(q)) x logi? 2 + (-3.44 + 0.60 x fh{q)) x logi? 3 
+ cos (7 log R - 1.75) x exp (-4.2 log R) + 5 log R + 2.5 log (q) , Mmax (q)] - 5 log R - 2.5 log (q) 

with 

to (q) = 5.325 + 5.373 x q - 2.128 x q 2 
/x max (q) = 29.80 + 0.0933 x q 

Similarly, we fit the detection probability for the COMBO-17 data by a double exponential function: 

'TO - TO , C S 



Pcombo-17 — n x exp 



exp 



(A3) 



(A4) 



(A5) 



(A6) 



Since the COMBO-17 detection probability does not depend on the axis ratio, both a c = a c (Rl PP ) and to 0jC = 
to , c (Rl PP ) take much simpler forms as functions of R* pp only. The slope a c is defined as: 

a c = 0.168 + 0.388 x 

2\ 



exp 



i fl" pp n 

Tog-|-,0 



2.131 



0.895 



(A7) 



and the characteristic magnitude too, c is defined as: 

to ,c = 23.85 - 0.274 x R + 0.507 x R 2 - 0.403 x .R 3 (A8) 
with R = max [logi?g PP , 0.3]. The normalisation n differs slightly from unity due to the effect of redshift focussing: 

n = 1.014 + 0.00112 x R (A9) 
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INCORPORATING THE Wolf et al. (2003) COMPLETENESS MAP 

The completeness map given in Wolf et al. (2003) contains values for the COMBO-17 detection probability 
PcOMBO-17 — PCOMBO-17 ( m R,apor) Z i (U - V) rest ) as a function of the apparent i?-band aperture magnitude ra Riaper , the 
rcdshift z and the (U-V) Ioal rest-frame color. In order to convert this completeness map into our magnitude- size frame 
Pcombo-17 = Pcombo-17 (m z ,Rl PP ), we take the following approach. 

We start with a simulated catalogue containing a uniform distribution of apparent z-band magnitudes, apparent 
half-light sizes (uniformly distributed in logi?g pp ) and redshifts. Then, we convert the GEMS z-band magnitude m z 
into a COMBO-17 total i?-band magnitude m R tot = m R tot (m z ,z). The following polynomial fit to the data is an 
adequate description (where RND denotes a normally distributed random number with a mean of zero and a standard 
deviation of 1): 

m R , tot =m z - [RND x (-0.308 + 0.0253 x m z ) 

-0.202 - 0.340 x z + 3.984 x z 2 - 13.881 x z 3 
+13.918 x z 4 - 4.264 x z 5 

+2.951 - 0.376 x m z + 0.0110 x m 2 ] (Bl) 

Next, we relate m R tot to m R , aper (the COMBO-17 completeness map is expressed in terms of m Riaper ). By assuming 
that the aperture loss for the disk galaxies in COMBO-17 is a function of the half-light radius. For the n < 2.5 disk 
sample we find a linear correlation between the difference of total and aperture COMBO-17 magnitude m R , tot — r7i R , apor 
and y/RT: 

m R , aper = m R , tot - 0.508 + 0.254^7?!™ - 0.226 x RND (B2) 

The scatter about this relation is only 0.23 mag. 

Finally, we estimate the COMBO-17 (U — V) color given the GEMS m z , redshift and size. We find that the following 
description, which is a function of m z and redshift, is an adequate representation of the data: 

(£/- V) rest =RND x 0.270 

+0.480 - 0.534 x z + 0.125 x z 2 

+2.417 - 0.107 x m z (B3) 

Using these transformations, we assign a COMBO-17 detection probability from the completeness map given in Wolf 
et al. (2003) to each mock GEMS galaxy, where the detection probability is a function of m z , i?g pp and z. We have 
compared the results of the modeling to the direct values from the Wolf et al. (2003) completeness map. Statistically 
the agreement is good and our subsequent conclusions are unaffected by which particular method is chosen. We have 
carried out the analysis using both methods, arriving at the same conclusions. In the paper we refer to our statistical 
approach in order to clearly demonstrate the fact that COMBO-17 is somewhat deeper in terms of surface brightness. 

ANALYSIS OF COMPLETENESS AND SELECTION EFFECTS 

In order to address the effects of our completeness correction and sample selection we take the following approach: 
For each redshift bin we calculate histograms of [iy (z) and log (z) using the inverse detection probability p of 
each object as a weight. From these "weighted" histograms we measure average values for the rest-frame absolute 
surface brightness in the V-band, (/iy (z)) and stellar surface mass density, (logEyn (z)). These average values and 
the corresponding errors we obtain by constructing 1,000 Monte-Carlo realisations of the GEMS data for each redshift 
bin. Each realisation consists of a random subsample of the whole data set containing as many sources as the full set, 
but allowing for duplicate data points. The adopted average values originate from the average mean value of the 1,000 
simulations, while the error bars in (fiy (z)) and (logS^vi (z)) were calculated from the scatter of the 1,000 mean value 
estimations. Using such a procedure, we are able to correct for galaxies missing in the GEMS survey down to the level 
where we can reliably estimate the detection probability p when calculating average mean values. 

The calculation of ([iy (z)) and (log (z)) is affected by three limitations. At some limiting magnitude m" m the 
detection probability p drops to zero. The same occurs at some limiting surface brightness ^* pp,lim . Both effects limit 
the range of absolute magnitude and surface brightness that is covered by the GEMS data. The higher the redshift, 
the brighter is the corresponding limiting absolute magnitude M|} m and the limiting rest-frame surface brightness /x'y m . 
As a result of this, we have to restrict the study of the average galaxy population at each redshift bin to the galaxies 
brighter than M|> m and fiy" corresponding to the highest redshift bin. Finally, the value of p Ura , at which one does not 
include objects in the calculation of (^iy (z)) and (log Ex (z)), also potentially impacts on the analysis. In the case of 
(logEjVi (z)) the absolute magnitude limit M y m translates into a limiting mass \ogM lirn . 

As will be shown in our subsequent analysis we find My" 1 = — 20 and log A4 Uln — 10. We will also provide further proof 
for the fact that the GEMS data are not limited in surface brightness even at the highest redshift bin. Furthermore, 
we will show that our results are fairly independent of the choice of p llm . We adopt a rather conservative value of 
p Ura — 0.5. In order to demonstrate these results, we calculate (^y (z)) and (logE^ (z)) for various combinations of 
p lim , /4 m , My™ and logAf im . 

In Fig. CI we plot (nv (z)) and (logEjvi (z)) as a function of the adopted p lim while holding My™ = —20, \ogA4 Urn = 
10 and /x'y m = oo constant. Both (fiy (z)) and (logEyn (z)) do not vary significantly, i.e. one would obtain the same 
results for {[iy (z)) and (log Ex (z)) using p Urn — 0.2 or p Ura — 0.8. The reason for this is two-fold. On one hand the 
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absolute magnitude limit My" 1 = — 20 and the lower stellar mass limit log.M llm — 10 are chosen rather conservatively, 
leading to a removal of almost all sources with p < 0.8. On the other hand, once the absolute magnitude limit reaches 
the region where the detection probability drops, galaxies fall along a line of constant apparent magnitude (see Fig. 4, 
at m z ~ 23.75 the 50% completeness contour is almost vertical). Thus, calculation of a mean surface brightness (or 
surface mass density) is evenly (un-) affected by the completeness correction (independent of surface brightness). Both 
these arguments arise from the fact that the GEMS data is not limited in surface brightness. 

We repeat this exercise for /x'y m (see Fig. C2). This time we hold p lim = 0.5, Mjj m = —20 and log.M lim = 10 fixed. 
We find that there is a characteristic surface brightness at each rcdshift at which the estimated values of {/ly (z)} 
and (log Y>m (z)) systematically start to deviate towards higher surface brightnesses. This has to be interpreted as 
the surface brightness at which one starts removing galaxies from the sample with the lowest surface brightness, thus 
shifting the average to higher surface brightnesses. Measuring constant mean values at the lowest surface brightnesses, 
however, does not necessarily imply that the average does not shift. It rather means that we run into our completeness 
limit eventually, i.e. we do not detect the galaxy population at all that might yet exist at such a faint level. The 
question is, whether we reach a plateau in (/zy (z)} or (logSx (z)} before we run into the GEMS surface brightness 
limit. To test this, we convert the apparent z-band surface brightness limit, i.e. where a completeness level of 50% is 
reached, as obtained from the GEMS completeness map ^pp> 50% ~ 23.9 into a rest-frame surface brightness limit /jy % 
in the V-band for each redshift bin using the following relation: 

with a redshift z dependent color term m v st — m z . The term — 10 log (1 + z) arises from the surface brightness dimming 
oc (1 + z) , which has to be accounted for when converting an apparent surface brightness to an absolute one. From 
a fit to the GEMS data we obtain: 

ray' - m z = M v - m z + 5 log D L + 25 

= 0.562 - 0.111 x z + 1.160 x z 2 - 0.841 x z 3 (C2) 

with the luminosity distance D^. The values obtained in this manner are indicated in Fig. C2 as vertical lines. We 
find that we can calculate the average galaxy population representing values for (/iy (z)} and (logS^ (z)) at z = 0.2 
down to a limiting surface brightness <~ 23.7. At higher rcdshift the corresponding value has dropped significantly, 
fj,y % <~ 21.7 at z = 1.0. Fortunately, even at that redshift we see that the average value (fiy (z)) has already flattened 
out, thus implying that even at the high redshift end of the GEMS survey we do sample the full distribution of surface 
brightnesses. The same line of reasoning also applies to (logS^ (z)). 

Finally, we examine the effect of the choice of M v m and log.A/f lim while holding p llm = 0.5 and /iy m = oo fixed (see 
Fig. C3). Similarly to Fig. C2 we overplot the 50% detection limit m~ z % <~ 23.7 at the faint magnitude end of the 
completeness map converted to a rest- frame absolute magnitude limit My % . Now the conversion reads: 

M y 0% = m° pp ' 50% + [my 1 - m z ] - 5 log D L - 25 

= ml PP - &0% + [M V -m z ] (C3) 

with the same definitions as above. We have not attempted to construct a similar relation for the case of log A4 lirn . We 
find that both (fiy (z)) and (logE^ (z)} vary systematically as a function of Mj} m and log.M llm . The reason for this 
is that the distribution of galaxies in the magnitude-size plane does not exactly fall along a line of constant surface 
brightness, but has a slightly steeper slope. This is most obvious at the lower redshift bins where we have the largest 
dynamic range in absolute manitudes. Therefore, our results are strictly true only for the adopted limiting magnitude 
My" — — 20. If one were to repeat our evaluation with deeper data, thus reaching fainter absolute limiting magnitudes 
at the highest redshift, one should expect to find slightly different absolute values for (fiy (z)} and (log Yim i z ))- 
However, if the distribution of galaxies does not change with redshift in the magnitude-size plane, which would imply 
differential evolution, one would measure the same relative differences. The same line of reasoning of course also holds 
for logAf' m = 10. 

The only way to circumvent this problem would be to move from measuring evolution in the surface brightness to 
a new variable p, which matches the observed slope of the low redshift population 16 . Fitting the slope in our lowest 
rcdshift bin, one reads off approximately logp oc M + 2.5 x 31ogi?, with magnitude M and radius R. This quantity 
has the physical dimensions of a volume density instead of a surface density, being proportional to the radius cubed. 
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Fig. 1. — The disk sample definition in the apparent magnitude-size plane (apparent effective radius: Re PP , apparent z-band magnitude: 
m z ). Top left: all galaxies detected in the GEMS tiles. Top right: galaxies with COMBO-17 rcdshifts. Bottom left: disk galaxies with 
Sersic index n < 2.5. Bottom right: disk galaxies with reliable GALFIT fits (see text for details on selection criteria). In each panel, we give 
the total number of galaxies in the upper right. 
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Fig. 2. — The average ratio of the disk scale length Rd (A) measured in various bands (B, V, R, I, H, K) over Rd (V) measured in the 
V-band for the de Jong (1996) data as a function of corresponding wavelength A (solid dots). The solid line marks a linear fit /, which 
is constrained to / (2.74) = 1 at the V-band (not strictly requiring coincidence with the data point at the V-band). Over the redshift 
range sampled by the GEMS data the size corrections as inferred from this plot are of order ±3%. Errors indicate the dispersion of the 
distribution of Rd (A) /Rd (V); they do not represent errors of the mean values plotted here. Solid boxes mark data points from the SDSS 
DR2 data set (see § 2.5 for details). Since there is no direct measurement in the V-band available for these objects, the data points arc 
simultaneously fit minimizing the total offset between SDSS and de Jong values using the g^B-band a s the reference filter. Open symbols 
represent measurements from the GEMS survey where in a certain redshift bin one of the filters matched the rest-frame V-band. 
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Fig. 3. — The GEMS (left panel) and the COMBO-17 (right panel) detection probabilites as a function of apparent z-band magnitude 
m z and apparent half-light radius R^ pp . The contours indicate different detection probability levels (from light to dark shades of grey: 
10%, 20%, 30%,... 90%). For comparison, in both plots a line of constant apparent surface brightness /i| pp = 24 mag arcsec -2 is shown. 




Fig. 4.— The combined GEMS / COMBO-17 completeness (dashed contours indicating 10%, 20%, 30%,... 90%) in the iC P -m z -planc. 
The solid contour shows the 50% completeness limit. Dots and pluses indicate disk galaxies with a combined GEMS detection / COMBO-17 
rcdshift estimation probability p > 50% and p < 50%, respectively. The panels show the completeness contours and data points for three 
different axis ratio ranges, indicated in the top of each plot. The diagonal line indicates a constant apparent surface brightness Hz PP = 24 
mag arcsec -2 . 
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Fig. 6. — The completeness as a function of redshift in the absolute magnitude-size plane. The same completeness contours and objects 
from the middle panel of Fig. 4 are plotted. The z ~ bin shows the SDSS data. The contours were computed for the central redshift of 
each bin as indicated in the top left of each panel. Galaxies with low detection probability are found along the absolute magnitude limit, 
but not at bright magnitudes and faint surface brightnesses. The box in each panel encloses a selection with fiy" = 22 mag arcsec" 2 and 
My" 1 = —20. This plot shows that the GEMS data (0.2 < z < 1.0) are not limited in surface brightness. Galaxies are observed in regions 
where the completeness contours indicate detection probabilities p < 5%. This is a result of the finite width of the redshift bins (see § 4.1 
and Fig. 7 for further explanations). 



GEMS: The Size Evolution of Disk Galaxies 



17 




Fig. 7. — The magnitude-size relation for six different redshift bins. The solid contours indicate If levels of fractional number of objects 
per unit area. To obtain the contours the density field was normalised by the total number of objects being plotted in each redshift bin 
and the following scaling was used: (exp(a;/2) — f) X 0.0005, with x = [1,2, 11]. The dotted contours are only shown for the lowest 
redshift bin (z ~ 0; SDSS data) for being too noisy in the other bins. The grey-scale contours mark the same completeness levels as shown 
in Fig. 6. The thin vertical lines in each panel symbolise an apparent magnitude m z = 24 converted to the rest-frame at three different 
rcdshifts corresponding to the center, low and high end of each redshift bin. This illustrates why apparently so many galaxies especially in 
the low redshift bin were detected at completeness values close to zero. The diagonal line in each panel corresponds to the average surface 
brightness fiy = 20.84 mag arcscc -2 in the lowest redshift bin. The thick vertical line represents the lower magnitude limit (My = —20), 
below which we exclude galaxies from the analysis. The thick dashed line shows the magnitude limit of the highest redshift SDSS data; at 
fainter magnitudes the VAGC data become incomplete. 
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Fig. 8. — Histograms of the absolute rest-frame surface brightness as a function of redshift (z ~ from SDSS; 0.2 < z < 1.0 from 
GEMS). The histograms include weighting of individual objects according to their completeness (y-axis peak normalized). Only objects 
with a completeness exceeding 50% and with My < —20 were included in order to minimize the impact of selection effects. The black 
circle marks the estimated mean value (fiy (z)} for each redshift bin. In each redshift bin with z > 0.0 we overplotted as open circles values 
of (fiy (z)) of all lower redshift bins to visualize the evolution in surface brightness. The vertical line in each panel indicates the Freeman 
(1970) surface brightness converted to the V-band (conversion given in Fukugita et al. 1995). The numbers in the top left of each panel 
shows the number of sources used in each panel (not including weighting). 
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Fig. 9. — Evolution of the average absolute surface brightness (fiy (2)). Solid dots show the GEMS data for the individual redshift bins; 
the box symbol at z ~ 0.05 indicates the SDSS data point. The error bars mark la statistical errors. The horizontal line at fiy = 21.05 
represents the Freeman (1970) surface brightness converted to the V-band. The solid and dotted lines mark a linear fit to all data points 
plus the la confidence limits, respectively. 
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Fig. 10. — The stellar mass— size relation for six different redshift bins (z ~ from SDSS; 0.2 < z < 1.0 from GEMS). The contours 
indicate levels of the same fractional number of objects per unit area as in Fig. 7. The vertical line in each panel marks log (A4/A4q) = 10, 
which corresponds to the limiting stellar mass logA'l 11111 applied to each redshift bin. The diagonal line in each panel corresponds to the 
average surface mass density measured from all redshift bins (logE^vt (2 = 1)} = 8.50. 
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Fig. 11. — Evolution of the average stellar surface mass density (logE^t (z)). Solid dots show the GEMS data for the individual rcdshift 
bins; the box symbol at z ~ 0.05 indicates the SDSS data point. The solid error bars indicate the 2<r statistical errors; the dotted error 
bar for the SDSS data point marks the 2<r systematic error resulting from the conversion of a mass-to-light ratio. The horizontal solid and 
dotted lines at (logS^vi (z)) = 8.50 ± 0.03 represent a linear fit to the data with a constant slope of zero and the la confidence limits, 
respectively. The diagonal line indicates the evolution as obtained from Mo ct al. (1998). 




Fig. 12. — Histograms of the stellar surface mass density as a function of rcdshift (z ~ from SDSS; 0.2 < z < 1.0 from GEMS). 
The histograms include weighting of individual objects according to their completeness (y-axis peak normalized). Only objects with a 
completeness exceeding 50% and with \og(M/ Mq) > log.M llm = 10 were included in order to minimize the impact of selection effects. 
The black circle marks the estimated mean value (logEjn (z)) for each redshift bin. In each rcdshift bin with z > 0.0 wc ovcrplotted as 
open circles values of (logEjn (z)) of all lower rcdshift bins to visualize the evolution in surface mass density. The numbers in the top left 
of each panel shows the number of sources used in each panel (not including weighting). 




Fig. 13. — Comparison of the evolution of the average absolute central i?-band surface brightness (/io,B (z)) with the literature. Solid 
dots show the GEMS data for the individual rcdshift bins; the box symbol at z ~ 0.05 indicates the SDSS data point. The error bars mark 
2<r statistical errors. The horizontal line at fig g = 21.65 represents the Freeman (1970) surface brightness. The solid and dotted lines 
mark a linear fit to all data points (including the SDSS value) plus the lcr confidence limits, respectively, a) Comparison with Simard et al. 
(1999, star symbols), b) Reproducing the Ravindranath et al. (2004) analysis. Triangles show values obtained from the GEMS data using 
the selection in limiting surface brightness and absolute magnitude as chosen by Ravindranath ct al. (2004). The open symbols indicate 
the impact of a population of high surface brightness galaxies at the highest redshift bin using the definition of Ravindranath et al. (open 
triangle, just barely visible above the filled triangle) and the definition by Simard et al. (open star). Especially when using the latter 
definition, the GEMS data with the Ravindranath et al. selection are consistent with no surface brightness evolution. 
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Fig. CI. — The average surface brightness (fiy (z)) (left panel) and the average surface mass density (log Ex (2)) (right panel) for 
the different redshift bins as a function of the adopted cut-off detection probability pii m . Horizontal dotted lines mark the means of each 
rcdshift bin. In both panels an absolute limiting magnitude M"j) m = —20 was applied. 



GEMS: The Size Evolution of Disk Galaxies 



21 



21 

19 
21 

19 
21 



E. 

a 19 



■i-U I 



z=0.2 



. • • • • 



z=0.4 



21 

19 
21 

19 



z=0.6 



z=0.8 



z=1.0 



l i • • l 



z=0.2 



8.5 



z=0.4 



1. 8.5 - • • • • • 



fcL 

cn 
o 



z=0.6 



z=0.8 



1.5 * 



z=1.0 



8-5 -Uiiii * 



20 22 24 26 20 22 24 26 



n7 [mag arcsec ] 



n7 [mag arcsec ] 



Fig. C2. — The average surface brightness {/iy (z)) (left panel) and the average surface mass density (log Ex (z)) (right panel) for the 
different redshift bins as a function of the adopted surface brightness limit /utj. m . Horizontal dotted lines mark the means of each rcdshift 
bin. The values at /i 1 ™ = 24 are plotted to demonstrate the results in the case were no fixed surface brightness limit (p}y m = oo) is chosen. 
In both panels an absolute limiting magnitude M[j m = —20 and a detection probability cut pii m = 0.5 was applied. The vertical lines 
correspond to the 50% completeness limiting surface brightness for an axis ratio q = 0.5 at the indicated redshift. 
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Fig. C3. — The average surface brightness (2)} (left panel) and the average surface mass density (log Ex (2)) (right panel) for the 
different redshift bins as a function of the adopted absolute magnitude limit Mj} m or the limiting stellar mass logA1 llm (left and right 
panel, respectively). Horizontal dotted lines mark the means of each redshift bin. The values at M 1 ™ = -18 (left) and logX lim = 9 
(right) arc plotted to demonstrate the results if no fixed absolute magnitude limit (M[} m = —00) or stellar mass limit (log _A/f llm = —00) 
is chosen. In both panels a detection probability cut pi; m = 0.5 was applied. The vertical lines in the left panel correspond to the 50% 
completeness limiting absolute magnitude at the indicated redshift. 



